Deep Reinforcement Learning-Based Optimization for End-to-End Network Slicing With Control- and User-Plane Separation
نویسندگان
چکیده
Control- and user-plane separation (CUPS) network slicing are two key technologies to support increasing traffic diverse wireless services. However, the benefit of CUPS in decoupling coverage data service functions has not been fully utilized facilitate slicing. In this paper, we present a novel CUPS-based end-to-end (CUPS-E2E) scheme. First, base stations (BS) classified into control BSs (CBS) that provide plane (CP) (TBS) deliver user (UP) traffic. Next, upon CBSs TBSs being virtualized, define four typical (E2E) slices: one for CP coverage, high-throughput services, computation-intensive other delay-sensitive The utilities E2E slices defined based on their throughput, computing capability delay requirements, respectively. Then, deep deterministic policy gradient (DDPG)-based algorithm is employed maximize long-term sum-utility by jointly optimizing allocation communication resources activation virtual TBSs, while meeting requirements all users. Simulation results show our proposed CUPS-E2E scheme conjunction with DDPG-based maximization can wide-coverage massive access as well UP high-throughput, services simultaneously, outperforms existing schemes terms sum-utility, percentage, throughput delay.
منابع مشابه
End-to-End Deep Reinforcement Learning for Lane Keeping Assist
Reinforcement learning is considered to be a strong AI paradigm which can be used to teach machines through interaction with the environment and learning from their mistakes, but it has not yet been successfully used for automotive applications. There has recently been a revival of interest in the topic, however, driven by the ability of deep learning algorithms to learn good representations of...
متن کاملEnd-to-End Optimization of Task-Oriented Dialogue Model with Deep Reinforcement Learning
In this paper, we present a neural network based task-oriented dialogue system that can be optimized end-to-end with deep reinforcement learning (RL). The system is able to track dialogue state, interface with knowledge bases, and incorporate query results into agent’s responses to successfully complete task-oriented dialogues. dialogue policy learning is conducted with a hybrid supervised and ...
متن کاملTVM: End-to-End Optimization Stack for Deep Learning
Scalable frameworks, such as TensorFlow, MXNet, Caffe, and PyTorch drive the current popularity and utility of deep learning. However, these frameworks are optimized for a narrow range of server-class GPUs and deploying workloads to other platforms such as mobile phones, embedded devices, and specialized accelerators (e.g., FPGAs, ASICs) requires laborious manual effort. We propose TVM, an end-...
متن کاملEnd-to-end LSTM-based dialog control optimized with supervised and reinforcement learning
This paper presents a model for end-toend learning of task-oriented dialog systems. The main component of the model is a recurrent neural network (an LSTM), which maps from raw dialog history directly to a distribution over system actions. The LSTM automatically infers a representation of dialog history, which relieves the system developer of much of the manual feature engineering of dialog sta...
متن کاملTowards End-to-End Learning for Dialog State Tracking and Management using Deep Reinforcement Learning
This paper presents an end-to-end framework for task-oriented dialog systems using a variant of Deep Recurrent QNetworks (DRQN). The model is able to interface with a relational database and jointly learn policies for both language understanding and dialog strategy. Moreover, we propose a hybrid algorithm that combines the strength of reinforcement learning and supervised learning to achieve fa...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Transactions on Vehicular Technology
سال: 2022
ISSN: ['0018-9545', '1939-9359']
DOI: https://doi.org/10.1109/tvt.2022.3191882